3  Natural gas

3.1 Utility Service Area Maps

To identify the utilities operating within our 11-county study area, we utilized data sets published by state and federal sources. The outcome of our data collection, yielding a list of distinct utilities for which activity data should be collected, resides within distinct_natGas_util_WI.RDS (4 utilities, all investor-owned) and distinct_natGas_util_type_MN.RDS (7 utilities, six investor-owned and one municipally-owned)

3.1.1 Minnesota

Since there is no state-maintained map of natural gas service territories in Minnesota (“Public Utilities Infrastructure Information for Minnesota n.d.), we looked to the Homeland Infrastructure Foundation-Level Data (HIFLD) database, a product of the Department of Homeland Security’s Geospatial Management Office (DHS GMO) that compiles “foundation-level geospatial data” for homeland security, homeland defense, and emergency preparedness purposes. This dataset was last updated in 2017 (Department of Homeland Security 2017). Though this dataset is national in scope (see fig. X), it is clipped to only include utility service areas within the nine Minnesota counties included in the study area of this inventory (see fig. Y – note that utilities operating across county lines have a polygon covering the extent of their service territory within each and every county they operate within).

3.1.2 Wisconsin

The Public Service Commission of Wisconsin publishes and maintains maps of service territories for natural gas utilities operating within the state. This data set relies upon, and is accurate to, “the extent that various sources [utilities] supplied accurate data.” (Public Service Commission of Wisconsin 2021) This dataset spans the whole state of Wisconsin (see fig, X), and both clipped to the two Wisconsin counties included in the study area of this inventory (see fig. Y)

for electric… as well as the full service territories beyond the two-county area to facilitate estimation and modeling of energy demand (see fig. Z)

For electric..Because Wisconsin does not require utilities to report the amount of energy, both the extent of utilities’ operations within our study area (St Croix and Pierce counties within Wisconsin) as well as their statewide service territories represent important inputs to our methodology for estimating both number of customer accounts and energy

inscope

3.2 Utility Activity Data

In this section, we will describe the acquisition of utility activity data. Analysis process and results are described in the following chapter.

3.2.1 Minnesota

Under Minnesota Administrative Rules Chapter 7610 (Minnesota Department of Commerce, n.d.), utilities are required to file an annual data report that supports the identification of “emerging energy trends based on supply and demand, conservation and public health and safety factors, and to determine the level of statewide and service area needs.” (Minnesota Department of Commerce 2022) This includes a report of county-level energy deliveries (reported in thousand cubic feet, commonly written as mcf). Because the information is structured in this manner, natural gas emissions at the county-level can be estimated as a direct function of energy deliveries to counties reported by utilities, which isn’t the case in Wisconsin (some modeling and estimation is required in WI).

#bar graph with utility county data?

#some data exists for MN customer counts… could be worthwhile to calculate account level average for comparison to WI

3.2.2 Wisconsin

Under Wis. Stat. § 196.07, investor-owned natural gas utilities operating within the state of Wisconsin must submit annual reports to the State which include an array of information related to utility finance and operations, including key figures leveraged in our data collection, such as total energy deliveries made (in units of therms) and total number of customer accounts within each county (Legislature, n.d.). Because all four “in-scope” natural gas utilities in Wisconsin are investor-owned, we were able to estimate county-wide emissions from natural gas in 2021 by first calculating the proportion of each utility customer’s residing within Pierce and St. Croix counties, and allocating that proportion of the utility’s total reported energy delivery to each county. This approach represents a divergence from our Minnesota process, which involves aggregating county-level numbers directly reported by utilities, and implicitly assumes that customer accounts across counties within the operations of a given utility have the same average per-account demand for energy, when in actuality this is likely impacted by land-use mix and relative magnitude/scale of residential and commercial/industrial utility accounts within a given county (citation needed?)

#histogram or other visualization of customer

For electric… justifying modeling approach (reference to mean, use of a weighted average of per-account numbers for all utilities where we have actual and estimated per-account figures. All an estimation.

For electric.. more detail on how population in service territories was used to 1) estimate customer/account counts, 2) allocate total utility numbers based on estimated accounts within the in-scope area, and 3) use estimated per-customer energy deliveries, alongside population figures and total customer counts within and without our study area for the two utilities that didn’t energy activity to the state or federal government, to model energy delivery –> customers/cus

Data source description, type

  • Quality rank (See Table C.2)
  • How, when, and why was the data collected?
  • If this is a modeled dataset, what is the sample?
  • What is the raw unit of measurement?
  • How was this data accessed? Include any relevant links/citations, code, or downloads.
  • What data cleaning or wrangling was completed? How did you test these processes and outputs?
  • What is the geographic and temporal scope? Did you complete any aggregation?
  • What version is the data? Were there other versions available? If so, why did you choose this version?
  • What assumptions are made when we use this dataset?
  • Which subject matter expert (SME) reviewed this data?
  • Describe testing used to verify data

Be sure to add a citation of this dataset to the Zotero shared library.

3.2.2.0.1 Data characteristics
  • Were there any missing data? How did you handle missing data?
  • Plots, tables, and description of data distribution
  • Variance, Z-Score, quantiles
  • Facet views by categorical variables
3.2.2.0.2 Limitations
  • Usually only samples county and state roads, primary arterials
  • Not every site is sampled every year

Additionally, aggregating

3.2.2.1 Vehicle distribution by weight

Introduction text Data source description, type

  • Quality rank (See Table C.2)
  • How, when, and why was the data collected?
  • If this is a modeled dataset, what is the sample?
  • What is the raw unit of measurement?
  • How was this data accessed? Include any relevant links/citations, code, or downloads.
  • What data cleaning or wrangling was completed? How did you test these processes and outputs?
  • What is the geographic and temporal scope? Did you complete any aggregation?
  • What version is the data? Were there other versions available? If so, why did you choose this version?
  • What assumptions are made when we use this dataset?
  • Which subject matter expert (SME) reviewed this data?
  • Describe testing used to verify data

Be sure to add a citation of this dataset to the Zotero shared library.

3.2.2.1.1 Data characteristics
  • Were there any missing data? How did you handle missing data?
  • Plots, tables, and description of data distribution
  • Variance, Z-Score, quantiles
  • Facet views by categorical variables
3.2.2.1.2 Limitations

3.2.3 Data dictionaries